Semantic principal video shot classification via mixture Gaussian

نویسندگان

  • Hangzai Luo
  • Jianping Fan
  • Jing Xiao
  • Xingquan Zhu
چکیده

As digital cameras become more affordable, digital video now plays an important role in medical education and healthcare. In this paper, we propose a novel framework to facilitate semantic classification of surgery education videos. Specifically, the framework includes: (a) Semantic-sensitive video content characterization via principal video shots. (b) Semantic video classification via a mixture Gaussian model to bridge the semantic gap bwteen low-level visual features and semantic visual concepts in a specific surgery education video domain.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semantic video classification with insufficient labeled samples

To support more effective video retrieval at semantic level, we introduce a novel framework to achieve semantic video classification. This novel framework includes: (a) A semantic-senstive video content representation framework via principal video shots to enhance the quality of features (i.e., the ability of the selected low-level multimodal perceptual features to discriminate among various se...

متن کامل

TokyoTechCanon at TRECVID 2012

We aim at developing a high-performance semantic indexing system using Gaussian-mixture-model (GMM) supervectors and tree-structured GMMs [1, 2, 3]. GMM supervectors corresponding to six types of audio and visual features are extracted from video shots. Tree-structured GMMs reduce the computational cost of maximum a posteriori (MAP) adaptation for estimating GMM parameters while keeping accurac...

متن کامل

Semantic Shot Classification in Sports Video

In this paper, we present a unified framework for semantic shot classification in sports videos. Unlike previous approaches, which focus on clustering by aggregating shots with similar low-level features, the proposed scheme makes use of domain knowledge of specific sport to perform a top-down video shot classification, including identification of video shot classes for each sport, and supervis...

متن کامل

Recognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model

Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....

متن کامل

Global Journal of Computer Science and Technology

Rapid growth in multimedia technologies facilitates the acquisition and storage of videos in a cost effective manner; leads to the processing of ginormous videos. However, for effective processing, suitable search methodologies are essential pre-requisite in any video processing system. In this paper, we propose a proficient content-based video retrieval system with the aid of extensive feature...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003